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Abstract 

Let Aq be a g-letter alphabet and tc be a right infinite word on this alphabet. A 
subword of u) is a block of consecutive letters of w. The subword complexity function 
of w assigns to each positive integer n the number /«,(n) of distinct subwords of length 
n of w. 

The gap function of an infinite word over the binary alphabet {0,1} gives the 
distances between consecutive I’s in this word. In this paper we study infinite binary 
words whose gap function is injective or ’’almost injective”. A method for computing 
the subword complexity of such words is given. A necessary and sufficient condition for 
a function to be the subword complexity function of a binary word whose gap function 
is strictly increasing is obtained. 


1 Introduction 

The subword complexity (sometimes called symbolic complexity) of finite and infinite words 
became an important subject in Combinatorics on Words recently. Applications include 
Dynamical Systems, Ergodic Theory and Theoretical Computer Science (jH])- 

For a given inhnite word it is not easy to compute the subword complexity function. 
Classes of inhnite words whose subword complexity function has been computed include 
paperfolding sequences (see [T]), Rudin-Shapiro sequences, Thue-Morse sequences and gen¬ 
eralized Time-Morse sequences (see [10]) and sequences dehned by billiards in hypercubes, 
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which generalize Sturmian words. A survey of results of this kind can be found in j2] and 
[3], with 13] being the most recent. 

Another general problem of much interest is to determine which function can be the 
subword complexity function of an inhnite word. A list of known necessary conditions as 
well as a list of sufficient conditions is given in [Hj. 

If u and V are two hnite words over the same alphabet, then uv will denote the concate¬ 
nation of u and v. In particular, for a positive integer n, = uu...u {n times), = e, 
where e is the empty word. 

Let w be an inhnite word over the binary alphabet A = {0,1}. Since the subword 
complexity of a binary word does not change when we interchange 0 and 1, we can assume 
without loss of generality that w contains an inhnite number of I’s. 

Definition 1.1. The function G : N —M is called the 1-distribution function of w if G{i) is 
the position of the ith 1 in w. By convention G(0) = 0. The function g{i) = G{i) — G{i — 1) 
dehned for i > 1 is called the gap function. If g is strictly increasing, then w is said to be 
gap increasing. 

The 1-distribution function G{i) is called sometimes the occurrence function of a letter 
(see j3j). The main result of Section |2] is the following theorem. 

Theorem 1.2 (The Subword Complexity of Gap Increasing Words). Let w be a gap 

increasing word with 1-distribution function G and gap function g. If n < g{l), then the 
subword complexity function of w is fw{n) = n + 1, otherwise 

fwin) = G{Ln) + G{Ln -|- 1) — G{Mn -|- 1) -|- n{Mn — Ln) + U + 

where is the least non-negative integer which satisfies 

g{Ln + 1) + g{Ln + 2) > n -f-1 

and Mn is the maximum non-negative integer which satisfies 

g{Mn -h 1) < n - 1. 


Given a function g that satishes the hypothesis of Theorem II.2L we always can compute 
Mn and in 0{log{n)) time. 

We use Theorem o to hnd the asymptotic behavior of the subword complexity function 
of inhnite words over A = {0,1} with polynomial and exponential 1-distribution functions. 
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Let A; > 1 be an integer. If the 1-distribntion fnnction of w is G{n) = n^, then fw{n) = 
0(nfe^). If the 1-distribntion fnnction of w is G{n) = then fw{n) = 0(n). 

In Section El we give a recnrrence formnla for the snbword complexity fnnction of inhnite 
binary words whose gap fnnction is injective and later, in Section 0 generalize it to binary 
words whose gap fnnction is blockwise injective. A fnnction h : N —>■ M is called blockwise 
injective if h{i) = h{j) for i < j implies h{i) = h{i -|- 1) = ... = h{j). We prove that if w has 
the gap fnnction g{n) = cn + d, where c,dEN and c > 2, then fw{n + 2c) = fwin) + n + 2c 
for n > c + d. 

In Section E] we derive necessary and snfficient conditions for a fnnction / : N —> N to 
be the snbword complexity of an inhnite gap increasing word. First we prove the following 
necessary condition; /(I) = 2, /(2) = 3 and |A^/(n)| < 1 (A is the nsnal difference 
operator). 

To formnlate the necessary and snfficient condition, we introdnce additional notation. Let 
and be two strictly increasing seqnences that have no elements in common, 

then U will denote the strictly increasing seqnence that consists of all the 

elements of both seqnences. We allow a seqnence to have no elements (be empty). If {yi\i>i 
is empty, then U {yi}i>i = {xi}i>i. Also, if {xi\i>i is a strictly increasing seqnence 

of positive integers and p < xi be a non-negative integer, then crp({xj}i>i) will denote the 
strictly increasing seqnence {p + xi + l,x\ + X 2 + 1 ^X 2 + X'i + ...}. 

Theorem 1.3 (Necessary and sufficient condition for a function to be the subword 
complexity function of an infinite gap increasing word). Let f : Z_|_ —> Z_|_ be such 
that /(I) = 2, /(2) = 3 and |A^/(n)| < 1 (this is a necessary condition). Set 
to the sequence obtained by arranging the elements of the set {n G Z_|_|A^/(n) = 1} in 
strictly increasing order. Similarly is obtained by arranging the elements of {n G 

Z_|_|A^/(n) = —1} in strictly increasing order. Then f is the subword complexity function of 
an infinite gap increasing word if and only if {ai}i>i is not empty and there exist an integer 
p, 0 < p < ai, and a strictly increasing sequence of positive integers {cj}i>i, with no elements 
in common with either or such that U {cj}i>i is infinite and 


Based on Theorem o in Section E] we describe a practical method to determine if a 
fnnction / is the snbword complexity fnnction of an infinite gap increasing word and give 
examples (see Examples 14.141 and I4.15|l . We also show that it is not trne in general that a 
fnnction f : N ^ N cannot be the snbword complexity fnnction of two distinct infinite gap 
increasing words. 


3 






Many sequences that have been studied thoroughly (like Sturmian, Arnoux-Rauzy and 
Rote and Rudin-Shapiro sequences) have affine or ultimately affine subword complexity 
functions (see 171, i, i, i ). An inhnite word w has subword complexity 0(n) ultimately 
if there exists a positive integer N, such that for all integers n > N, the subword complexity 
function of w is 4>{n). 

Cassaigne proved in that if a and b are two integers, then there exists an inhnite 
binary word with subword complexity function an + b ultimately if and only if one of the 
two following conditions holds: 

1) a > 2; 

2) 0 < a < 1 and 6 > 1. 

In Section El we prove that for two integers a and b there exists a gap increasing word w 
with the subword complexity function fw{n) = an + b ultimately if and only if a > 2. This 
result is generalized in the next proposition, proved in Sectional 

Proposition 1.4. Let a and b he two integers. There exists an infinite binary word w with a 
blockwise injective gap function and subword complexity function /u,(n) = an + b ultimately 
if and only if one of the two following conditions holds: 

1) a>2. 

2) a = 0 and b> 1. 

In SectionEl in order to investigate when an+ 6 is ultimately the subword complexity of a 
gap increasing word, we show a geometric interpretation of the subword complexity of a gap 
increasing word. A consequence is that if w is gap increasing, then the subword complexity 
f oi w satishes n + 1 < f{n + 1) < \'n/2\ [n/2j + |'n/2"| + 1 and both bounds are exact. 

The main result of Section El is Proposition lb.21 which gives a method of computing the 
subword complexity of inhnite binary words whose gap function is blockwise injective. A 
consequence of Proposition lb.21 is that, if w is an inhnite binary word whose gap function is 
blockwise injective, then its subword complexity /„,(n) = 0{n?), more precisely 1 < /(n) < 
^ ^ + 1. Thus the topological entropy of w is 0. 

Since Proposition lb. 21 is rather big for an introduction section, here we present a method 
of computing the subword complexity function of inhnite binary words whose gap function 
is non-decreasing (a non-decreasing function is blockwise injective). 

To each unbounded non-decreasing function : M —>■ N we assign two functions j : N —>■ N 
and p : N —> N such that j is the strictly increasing function which assumes the same values 
and in the same order as g does, and, for each r G N, p{r) is the number of times g assumes 
value j{r). If p : N —>■ N is a bounded non-decreasing function, let b be the number of distinct 
values that g assumes. Then j(r) is dehned as above for 1 < r < 6, and p{r) is dehned for 
1 < r < 6 — 1. 
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Lemma 1.5. Let w be an infinite binary word whose gap function g is non-decreasing and 
iir), p{r) (and b if g is bounded) be defined as above. 

If g is unbounded, then, for every natural n, fw{n-\-l) — fw{n) eguals one plus the number 
of integer r solutions of 


j{r) < n 

j{r — 1) + p{r)j{r) > n + 1. 


( 2 ) 


If g is bounded, for every natural number n consider the ineguality system 


{ r < b 

j{r) < n (3) 

j{r — 1) + pir)j{r) > n + 1. 

//1 < n < j(^) — 1, then + 1) — fw{n) eguals one plus the number of integer r solutions 

of eg. If n > jib), then fw{n + 1) — /«,(n) is just the number of integer solutions of 

eg. 


2 Gap increasing words 

An infinite word is said to be gap increasing if its gap function is strictly increasing. The 
main result of this section is the following theorem. 

Theorem 2.1. Let w be a gap increasing word with 1-distribution function G and gap func¬ 
tion g. For n < g{l) the subword complexity function of w is fw{n) = n + 1. For n > g{l) 
the subword complexity function of w is 

fwin) = G{Ln) + G{Ln + 1) — G{Mn + 1) + n{Mn — Ln) + u + 1, (4) 

where L^ is the least non-negative integer which satisfies 

g{Ln + 1) + g{Ln + 2) > n + 1 (5) 

and Mn is the maximum non-negative integer which satisfies 

giM^ + l)<n-l. (6) 
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The proof of Theorem 12.11 follows from Propositions 12.21 and 12.21 proved later in this 
section. 

It should be mentioned that, given a function g that satishes the hypothesis of The¬ 
orem lO and n > g{l), we always can compute M„ and Ln in log{n) time. To see 
this notice that, since g is strictly increasing, g{i -|- 1) > i for all i > 0 and therefore 
g{i -|- 1) -|- g{i + 2) > 2z -|- 1 for all z > 0. This implies that <n and < n. We can use 
the dichotomy algorithm with initial value n for both and to hnd as the minimum 
solution of eq. and Mn as the maximum solution of eq. (0) (here we use again the fact g 
is strictly increasing). The running time for both is log{n). 

Proposition 2.2. Let w be an infinite gap increasing word. The number of subwords of 
length n of w that contain at most one 1 is n + 1. 

Proof. Let g be the gap function of w. Since g is strictly increasing, we can pick L such 
that g{L) > n+1. Then w' = is a subword w which contains all possible 

words of length n over {0,1} that have at most one 1. There are npl such words, thus the 
number of subwords of w of length n that contain at most one 1 is n -|- 1. □ 

Proposition 2.3. Let w be an infinite gap increasing word with gap function g and 1- 
distribution function G. The number of subwords of w of length n that contain at least two 
1 ’s is G{Ln) + G{Ln -|- 1) — G{Mn -|- 1) -|- n{Mn — Ln) if n > g{l), where Ln and Mn are 
defined in Theorem \2.1\ and 0 if n < g{l). 

Proof. Fix a positive integer n. Let Vi be the subword of length n oiw which occurs at place 
i m. w and S be the number of subwords of w of length n that contain at least two I’s. 

Since g is strictly increasing, all subwords of w which contain at least two I’s, occur just 
once in w. Thus S equals the number of z’s for which Vi contains at least two I’s. 

Let N be the least integer such that vn contains at most one 1. Then the number of 
subwords of w of length n that contain at least two Vs S = N — 1 + K, where K is the 
number of z’s such that Vi contains at least two I’s and i > N. 

First we hnd N. There are two possibilities: either iV = 1, which happens when n < ^'(l) 
and then S' = 0, or the (A^ — l)th character of tc is 1 (otherwise vat-i would contain at most 
one 1, which would contradict the choice of N). In the last case N — 1 = G{Ln) for some 
positive integer L„. 


Vn 


w = ... 01 100 

g{Ln + l) — i 


,0 0 ... 010 


(7) 


Because N = G{Ln) -|- 1 is the least integer for which vn contains at most one 1, we have 
that Ln is the least integer which satishes the inequality g{Ln -|- 1) -|- g{Ln + 2) — 1 > n. 
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Next we find the number K of i’s such that i > G{Ln) and Vi contains at least two 
I’s. All such ViS contain exactly two I’s (otherwise there would exist an / > such 
that g{l + 1) + 5 f(/ + 2) + 1 < n which is impossible because g{l + 1) + (/(Z + 2) + 1 > 
g{Ln + 1) + g{Ln + 2) + 1 > n). 

Let P be the maximum integer for which vp contains exactly two I’s. Clearly the hrst 
character of up is 1, otherwise vp+i would contain two I’s too, which would contradict the 
choice of P. So P = G{Mn) where is the maximum integer which satishes g{Mn + l) + l < 

n. 

Hence K is the number of Uj’s, G{Ln) < i < G{Mn), that contain exactly two I’s. For 
each integer I, Ln < I < we compute the number of i’s in the interval G{1 — 1) < i < G{1) 
for which Vt contains two I’s. If G{1 — 1) < i < G{1), then, as shown in (jS)), the only two I’s 
that Vi can contain are the G{l)th and G{1 + l)th characters of w. 


Vi 

, -^-s 

g(i+l)+l 

w; = ...0 1 0 ... 00 0 ... OO' 1 00 . .^00 1 'OO ... 0 0 ... 0 10... (8) 

G{1-1) G(l) G(l+l) G(l+2) 

Thus Vi contains two I’s if and only if G{1 + 1) — n + 1 < i < G{1) and there are 
G{1) — G{1 + 1) + n such ViS. 

The total number of ViS which contain two I’s and such that i > G{Ln) is 

Mn 

K = ^ ^ {G(^l) — G(Z + 1) + Ti) = G{^Ln + 1) — G(^Adn + 1) + — T„). 

l=Ln-\-l 

Thus the number of subwords of w of length n that contain at least two I’s is 
S = N-1 + K = G{Ln) + G{L^ + 1) - G{Mn + 1) + n(M„ - L„), 

which is the claim of the Proposition. □ 

Next we use Theorem o to hnd the asymptotic behavior of the subword complexity 
function of inhnite words with polynomial and exponential 1-distribution functions. 

Lemma 2.4. Let k > 1 be an integer and w be an infinite word whose 1-distribution function 
is G{n) = . The subword complexity function of w is 

/^(n) = 0(n^), 


where the Q-notation depends on k. 
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Proof. The gap function of w is g{n) = —(n—1)^ and eq. (0) becomes (/+2)^—> n+1. If 

n > k2^, then any I which satisfies eq. (0 is greater than 2, thus (/ + 2)^ — < 2/i;( = 

which yields > (n±i)i/(fe-i)_ Qn the other hand (Z + 2)^ — P > 2kP~^, so 

^ Hence L„ = ©(n^^), where the ©-notation depends on k. In the 

1 k 

same way = ©(?7,“). By Theorem 12.11 L.fri) = ©(n*^). □ 

Lemma 2.5. For an integer k > 1 eonsider the infinite word with 1-distribution funetion 
G{n) = The subword complexity function of w 

fw{n) = ©(n), 

where the Q-notation does not depend on k. 

Proof. Solving eq. © and Q for and Mn respectively we get Ln = 1 + [log;. 

Mn = l+ [logfc ffij. By Theorem EII]/«)(^) = 

k + k - A; ^J +' + n( 

As to the asymptotic behavior, fw{n) = ©(n). □ 

3 Recurrence formulas 

The formula that we obtained in Theorem o can be used to deduce a recurrence for the 
subword complexity function of an inhnite gap increasing word. However a combinatorial 
approach that we present next leads to a more elegant form of recurrence and applies to a 
larger class of inhnite binary words, those whose gap function is injective. In Section EJ the 
method of this section is generalized to inhnite binary words whose gap function is blockwise 
injective (the dehnition of blockwise injectivity will be given later, in Section IHl an instance 
of blockwise injective functions are the non-decreasing functions). 

The following dehnition applies to hnite as well as inhnite binary words. 

Definition 3.1. A subword u of a binary word w is called a right (left) special factor of w 
if both uO and ul (On and lu respectively) are subwords of w. Let Sw{n) denote the number 
of right special factors of w of length n and s(„(n) denote the set of all left special factors of 
w of length n. 


logfc 


n — 1 
k-1 


logfc 


n -I- 1 
A:2 - 1 


) n -Fl. 
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A very general recurrence formula for the subword complexity of an infinite binary word 
is 

f^(n + 1) = fw{n) + s^(n) (9) 

In Proposition Id.21 we show that the number s^{n) of right special factors of length n of 
an inhnite binary word w whose gap function g is injective equals the number of solutions 
of a certain system of inequalities involving g and n. In Proposition lb.21 we do the same for 
inhnite binary words whose gap function is blockwise injective. 

Proposition 3.2. Let w be an infinite binary word whose gap function g is injective. Then 
the number Su,{n) of right special factors of w of length n equals one plus the number of 
integer I solutions of 

" ( 10 ) 

l5'(^ - 1)+5'(0 >n + l. 


Proof. Since g is injective, any subword of w which contains at least two I’s occurs only once 
in w, thus this subword cannot be a right special factor. This implies that all right special 
factors of w contain at most one 1. Let m be a subword of w of length n which contains at 
most one 1. Since mO is always a subword of w, for w to be a right special factor it is enough 
that m 1 be a subword of w. Thus a subword m of tc is a right special factor of w if and 
only if it contains at most one 1 and ul is a subword of w. Hence Syjfn) equals the number 
of subwords of w of length n + 1 which contain at most two I’s and whose last letter is 1. 
Obviously O""! is such a word and is the only such word which contains less then two I’s. 
Thus Syj{n) is the number of subwords of w of length n + 1, which contain exactly two I’s 
and whose last letter is 1, plus one. 

We shall count the subwords of w of length n+1 of form v = where x = n—g{l) 

and 0 < X < g{l — 1) — 1. Since each such v occurs just once in w (because it contains two 
I’s), we count the number of Vs which satisfy the inequality 0 < u — g{l) < g{l — 1) — 1, that 
is the number of Vs which satisfy eq. p()|l . □ 

Corollary 3.3. Let w be a gap increasing word and L^ and be defined as in Theorem \2.1\ 
Then the recurrence formula for the subword complexity function of w when n > ^'(l) is 


fwip + 1) — fwip) + Mn+i — Ln + 1- 


For the values of n that are less than g{l), the recurrence is fwin + 1) = /«,(n) + 1. 

Proof. By Proposition 13.21 the number Sw{n) of right special factors of w of length n equals 
one plus the number of solutions of eq. (HDD. 
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If n < g{l), the number of solutions of eq. (fTn|l is zero. Thus s^{n) = 1 and fw{n + 1) = 
/^(n) + 1. 

Now consider n > g{l)- We make use of the fact that g is strictly increasing. g{l) < n if 
and only if I < Mn+i + 1, also g{l — 1) + g{l) > n + 1 if and only if Z > + 2. So there are 

Mn+i — Ln Z’s which satisfy eq. (uni). Thus, by Proposition Id.21 Sw{n) = Mn+i — Ln + I and 
fwin + 1) = f-win) + Mn_|_i — + 1. □ 

Example 3.4. Consider the infinite binary word w whose 1-distribution function is G{n) = 

. The gap function of w is g{n) = G{n) — G{n — 1) = 2n — 1. Since g is strictly 
increasing, we can use Corolla, ru \d . ,4 to compute the subword complexity ofw. The recurrence 
fw{n + 1) = fw{n) + Mn+i — Ln + I starts with n = 1 because g{l) = 1. 

Ln is the least I which satisfies 2Z + l + 2Z + 3>n + l, thus Ln = [(n — 3)/4"|. Mn is the 
maximum m which satisfies 2m + 1 < n — 1, thus Mn = \n/2\ — 1. By Corolla, rv VhtA 

+ 1) = U{n) + [{n + l)/2j - [(n - 3)/4] , 

fw{n + 4) — fu,{n) = Mn+l + Mn+2 + Mn+3 + Mn+4: ~ Ln + Ln+1 + -hn+2 + -hn+3 + 4 = 

\{ri + l)/2j + [(n + 2)/2j + [(n, + 3)/2j + [(n + 4)/2j — 

[(n — 3)/4] — [(n — 2)/4] — \{n — l)/4] — |'n/4] = n + l+ n + 3 — n = ?7, + 4. 

We get an elegant recurrence for the subword complexity function ofw: 

fn,{n + 4) = /^(n) + n + 4. 

Next we generalize this result to all inhnite words whose gap function is linear. 

Lemma 3.5. Let c > 0 and d be two integers such that c + d > 0. The subword complexity 
function of the infinite binary word w with gap function g{n) = cn + d satisfies the recurrence 
fw{n + 2c) = fw{n) + n + 2c for n > c + d. 

Proof. After solving eq. © and we get Ln = [(n + l — c — 2d)/2c] — 1 and = 
[(n — d — l)/cj — 1. Notice that 

2c 2c 

fw{^^ T 2c) fwip) ^ ^ Mn-\-i ^ ^ Ln-\-i—\ T 2c 

[{n — d)/cj + ... + [(n + 2c — d — l)/cj — [(n + l — c — 2d)/2c] — ...— 

\{n + c — 2d)/2c] + 2c = (n — d) + (n — d + c) — (n + c — 2d) + 2c = n + 2c. 

So fw{n + 2c) = fw{n) + n + 2c. □ 
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4 Necessary and sufficient conditions 

In this section we give necessary and snfficient conditions for a fnnction / : —>■ 

to be the snbword complexity of an inhnite gap increasing word. We use the notations 
^f{n) = f{n + 1) - f{n) and A‘^f{n) = Af{n + 1) - A/(n). 

The next proposition gives a necessary condition for a function to be the subword com¬ 
plexity function of an inhnite binary word whose gap function in injective. A stronger 
condition is obtained for gap increasing words. 

Proposition 4.1. Let w be an infinite binary word with gap function g and subword com¬ 
plexity function f. If g is injective, then 1 < Af{n) < n -|- 1. If g is strictly increasing, then 
1 < Af{n) < [n/2\ + 1. 

Proof. Clearly A/(n) is the number of right special factors of length n of w. 

Let g be injective, then it follows from the proof of Proposition Id.21 that the number of 
right special factors of w of length n equals one plus the number of subwords of w of form 
V = 0^10^1, for some x > 0 and |/ > 0 such that x + y = n — 1. Thus 1 < A/(n) < n -f 1. 

If g is strictly increasing, then 0 < x < y, hence x can take at most [n/2j different values 
and 1 < Af{n) < [n/2j -|- 1. □ 

Corollary 4.2. Let w be an infinite gap increasing word with subword complexity function 
f, then n + 1 < f{n) < \n/2'] [n/2\ + \n/2] 1. 

Proof. This follows from f{n) = 2 YllZi Af{i) and 1 < Af{i) < \i/2\ + 1 for all i. □ 

Remark 4.3. The lower bound in Corollary is exact: for any positive integer N there 

is an infinite gap increasing word w with the property that f{n-\-l) = n + 2 for 0 < n < N 
(w could be any gap increasing word with prefix 0^1 where k > N — 1). However there is 
no infinite gap increasing word such that f{n+l) = n + 2 for all n, that is an infinite gap 
increasing word cannot be Sturmian. The upper bound in Corollary \4.^ is also exact (this 
will be shown in Lemma ro) . 

Definition 4.4. We will say that an inhnite gap increasing word w has a double gap of 
length n if there exists an integer m > 0 such that either is a subword of w or 

is a prehx of w, in other words there exist two consecutive ’’gaps” (blocks of O’s 
between I’s or before the hrst 1) whose length sum is n. 

Lemma 4.5. Let f be a subword complexity function of an infinite gap increasing word. 
Then, for each n G one of the following statements is true: 

a) 10”! is a subword of w and w does not have a double gap of length n — 1, then 

AV(n) = 1, 
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b) lO""! is not a subword of w and w has a double gap of length n — 1, then A^/(n) = —1, 

c) 10"1 is a subword of w and w has a double gap of length n — 1, then /S? f{n) = 0 , 

d) 10”1 is not a subword of w and w does not have a double gap of length n — 1, then 
A2/(n) = 0. 

Proof. Clearly A/(n) = /(n + 1) — f{n) is the number of right special factors of w of 
length n. By the proof of Proposition 14.11 the number of right special factors of w of length 
n equals the number of subwords of w of form where x > 0, plus one. Thus 

= ^/(^ + 1 ) ~ ^/(^) is the difference between the number of subwords of w of form 
0 ^ 10 ”“^! and the number of subwords of w of form 

For any n G Z+, let C{n) denote the set of subwords of w of form 1. Then 

AVH = |C(n+ 1)1 - \C{n)\. 

For every v G C{n + 1), let (fiy) be the suffix of length n + 1 of v. If x 7 ^ 0, then 
0(O'^1O"'“'^1)) = G C{n). If X = 0, then 0(1O"'1) = 0"'l ^ C{n). We conclude 

that 0 maps all elements olC{n+ 1) but 10"'l (if it happens to be a subword of w) to C{n). 

For every element u = G C{n), exists if and only if is 

a subword of w. So does not exist if either u = is a subword of w or 

u = is a prehx of w, which happens when w has a double gap of length n — 1 . 

In case (a), when 10"'l is a subword of w and w does not have a double gap of length 
n — 1, 0“^ : C'(n) —> C(n + 1) is a well dehned function and C(n + 1) has one more element 
than C(n), so A^/(n) = 1 . 

In case (b) 0 : (^(n + 1) —>■ C{n) is an injective function and C'(n) has one more element 
than C{n + 1), so A^/(n) = — 1 . 

In case (c) 0 is a bijection, hence A^/(n) = 0. 

In case (d) 0 maps all but one element of C{n + 1) to C'(n) and 0“^ exists for all but 
one element of C'(n). Therefore the cardinalities of C{n + 1) and C'(n) are the same and 
A2/(n) = 0. □ 

Corollary 4.6 (Necessary condition for a function to be the subword complexity 
function of an infinite gap increasing word) . If f is a subword complexity function of 
an infinite gap increasing word, then /(I) = 2, /( 2 ) = 3 and |A^/(n)| < 1 . 

Proof. It follows from Lemma 14.51 that |A^/(n)| < 1. /(2) = 3 because 00, 01 and 10 are 
necessarily subwords of an inhnite gap increasing word, while 11 cannot be a subword of an 
inhnite gap increasing word. Indeed, if an inhnite binary words starts with 11, then the gap 
function g of this word satishes ^'(l) = 5 ^( 2 ) = 1 and thus g is not strictly increasing. □ 

Next we introduce some new terminology that will enable us to develop a method to hnd 
out if a given function is a subword complexity function of an inhnite gap increasing word. 
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It will be convenient to think of sets of positive integers as of strictly increasing sequences 
of positive integers. These sequences can be hnite, inhnite or even empty. We will make the 
following abuse of notation: if {xi}i>i and {yi}i>i are two strictly increasing sequences that 
have no elements in common, then i>i is the strictly increasing sequence that 

consists of all the elements of both sequences. If {yi}i>i is empty, then {xi}i>i U {yi}i>\ = 

Example 4.7. 

{2t}Z, U {2t - l}ti = {1, 2, 3,4, 5, 6 , 7, 8,10,12,14,16,18,...} = {z}ti U {2t}Z,. 

Definition 4.8. Let {xi}i>i be a strictly increasing sequence of positive integers and p < Xi 
be a non-negative integer. Then (Jp{{xi}i>i) will denote the strictly increasing sequence 
{p + Xi + 1, X1+X2 + 1, X 2 +X^ + 1, ...}. 

Example 4.9. 

aZ{2t}Zi) = {4, 7,11,15,...} = {4} U {4* + 3}“i. 

Theorem 4.10 (Necessary and sufficient condition for a function to be the sub¬ 
word complexity function of an infinite gap increasing word) . Let f : Z_|_ —> Z_|_ sat¬ 
isfy the necessary condition of Proposition^^ that is /(I) = 2, f{2) = 3 and |A^/(? 7 ,)| < 1 . 
Set {aj}j>i to the sequence obtained by arranging the elements of the set {n G Z_|_| A^/(? 7 ,) = 
1} in strictly increasing order. Similarly {hi}i>i is obtained by arranging the elements of 
{n G Z_|_|A^/(n) = —1} in strictly increasing order. Then f is the subword complexity func¬ 
tion of an infinite gap increasing word if and only if {ai}i>i is not empty and there exist an 
integer p, 0 <p < ai, and a strictly increasing sequence of positive integers with no 

elements in common with either {aj}i>i or {&*}*>!, such that {aj}j>i U {ci}j>i is infinite and 

LI {bZi>l L {cZi>l- (71) 


Proof. (—*>) Let / be the subword complexity function of the inhnite gap increasing word 
w = O'^lO^ilO^^lO-^^l..., where {ii]Zi is a strictly increasing sequence of positive integers and 
the integer q is in the range 0 < g < ji. We want to show that is not empty and that 

there exist an integer p, 0 < p < Oi, and a strictly increasing sequence of positive integers 
with no elements in common with either {aj}j>i or such that {aj}i>iU{cj}i>i 

is inhnite and eq. (ED is satished. 

Notice that 10^1 is a subword of w if and only if n is an element of Also w has a 

double gap of length n — 1 if and only if n — 1 = q + ji or n — 1 = ji +ji+i for some i, which is 
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equivalent to saying that n is an element of Let {cj}j>i be the strictly increasing 

sequence of integers n such that 10”! is a subword of w and w has a double gap of length 
n — 1. Clearly {ci}i>i has no elements in common with either {ai}i>i or It also 

follows from Lemma 031 that = {ai}i>i U {ci]i>i and = {hi]i>i U {ci}i>i. 

Set p = q. We proved that U is inhnite and eq. (ITT|) is satished. We still 

need to prove that {ai\i>i is not empty and 0 < p < ai. To prove both statements it is 
enough to prove that ji = oi. Suppose ji ^ ai, then ji = ci, which is impossible because 
{cj}j>i is a subsequence of and the hrst (and the least) element of o'q{{ji}'^i is 

p + ji + l> ji- 

(^) Let the function / satisfy the necessary condition of Proposition 14.11 and the se¬ 
quences and be as described in the hypothesis. Further suppose that 

is not empty and there exist an integer p, 0 < p < oi, and a strictly increasing sequence of 
positive integers with no elements in common with either or such 

that {aj}i>i U {cj}j>i is inhnite and eq. (ITT| is satished. We want to show that / is the 
subword complexity function of a gap increasing word. 

Denote the sequence {ai}i>i U by It follows from eq. (fTT|l that ji = oi. 

Thus p < ji and the inhnite word v = O^lO-^DO-^^lO-^^l... is gap increasing. Let fy be the 
subword complexity of v. We will show that fv = f- 

Since v is gap increasing, /.y(l) = 2 = /(I) and /^(2) = 3 = /(2). It is therefore enough 
to show that A^/^(n) = A^/(n) for all positive integers n. It follows from eq. (ITT| that 
all elements that and have in common form the sequence {ci}i>i. By 

T;emma f4.5l = 1 for all the elements of {aj}j>i, A^/^(6j) = —1 for all the elements 

of A‘^fy{n) = 0 for all positive integers n that are neither in nor in 

This means that A‘^fy{n) = A^f{n) for all positive integers n. □ 

Based on Theorem 14.101 we give a practical method of determining if a function / : ^ 

is the subword complexity function of an inhnite gap increasing word. 

From now on let’s hx a function / : such that /(I) = 2, /(2) = 3 and 

|A^/(u)| < 1. Further let’s assume that the sequences {ai}i>i and dehned in Theo¬ 
rem UHUl are already computed and is not empty. We need to hnd a way to determine 

if there exists an integer p, 0 < p < ai and a strictly increasing sequence of positive integers 
with no elements in common with either or {bi}i>i, such that eq. (fTT|l holds. 

The method involves testing every p, 0 < p < Oi. 

Fix some integer p, 0 < p < ai. There exists a sequence {ci}j>i that satishes the 
description above if and only if there exists a sequence U which 

satishes the following three properties: 

PI) ji = oi and is a subsequence of 

P2) is a subsequence of {p -h ji + l,ji + 42 + 1,42 + js + 1, •••}; 
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P3) Every element of the sequence {p + ji + 1, ji + j 2 + 1,^2 + js + 1, •••}, that is not 
an element of the sequence {bi}i>i, is an element of the sequence No element of the 

sequence {p + ji + 1, ji + j 2 + 1, j 2 + js + 1, •••} is an element of 

It follows from the proof of the Theorem 14. 101 that if the sequence {ji}“i with the prop¬ 
erties P1-P3 above exists, then the word w = 0^10^1 has the subword complexity 

function /. 

Proposition 4.11. Properties P1-P3 imply the following recursive construction of 
R1) The initial conditions: 
ji = Oi and is a subsequence o/{ji}^i; 

R2) The recurrence: 

Every element of that falls in the interval {as,as+i) fl {bt,bt+i) (if as is the last 

element of then {as,as+i) = {as,oo); if bt is the last element of then 

{bt, bt+i) = {bt, oo) ) is computed by: 

ji ji—s+t T ji—s+t—1 T !• 

T) The test: The sequence that is recursively computed using R1-R2 satisfies 

properties P1-P3 if and only if {bi}i>i is a subsequence of aq{{ji}°3^) = {p +ji + 1, ji +12 + 
1,12 +13 + 1 , •••} and has no elements in common with o'g({jj}“i). 

Corollary 4.12. Let f he the subword complexity function of an infinite gap increasing word 
w with prefix 0^1 for some nonnegative integer p. Then w is the only gap increasing word 
with prefix 0^1 and subword complexity function f. 

Proof. For a hxed function / (and therefore hxed and and a hxed p, the 

recursive construction given in Proposition 14.ll| gives a unique sequence {ji}“i. Thus there 
is a unique gap increasing word O^IO-^^IO-^^IO-^®!... with the subword complexity function 
/• □ 

Remark 4.13. By Corollary \4.12[ for any f : Z_|_ —>■ Z_|_ and any nonnegative integer p there 
is at most one infinite gap increasing word with prefix 0^1 with subword complexity function 
f. However it is not true in general that a function f : Z_|_ —>■ Z_|_ cannot he the subword 
complexity function of two distinct infinite gap increasing words. 

Consider the following two gap increasing words: 

u = 010^10^10^10^^10^^.. 

with gap function g^ defined by gu{^) = 2, (7^(2) = 4 and gu{i) = gu{'i — 2) I- gu{i — 1) for 
i > 3, and 

V = O^IO^IO^IO^^IO^^IO^®... 
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with gap function defined by gv{l) = 3, (7^(2) = 4 and gfii) = gfii — 2) + g^ii — 1) for 
i > 3. 

To prove that u and v have the same subword complexity it suffices to prove that the pair 
of sequences {ai}i>i and {hi}i>i, that correspond to u, and the pair of sequences {a[}i>i and 
m^>i, that correspond to v, are the same. 

By Lemma |7~3| the sequences {aj}i>i and consist of only one element ai = a[ = 3; 

while the sequences and are empty. 

Hence u and v have the same subword complexity function f{n) = + ^/(^); where 

^f{i) = 1 fori < 3 and A/(i) = 2 fori > 4. Thus f{n) = n+1 forn < 4 and f{n) = 2n —3 
for n > 5. 

The next two examples illustrate how one can use Proposition 14.111 to determine if a 
function / : Z_|_ —>■ Z_|_ is the subword complexity function of an inhnite gap increasing word. 

Example 4.14. We want to find out if there exists an integer q > 1 for which 


fin) 


n 

+ 1)(" - f 

n 

N. 

2 

N. 


+ 1 


is a subword complexity function of an infinite gap increasing word. 

First we check that the necessary condition of Proposition ^ ■ f\ are satisfied. Indeed /(I) = 
2, /(2) = 3 and 

77 - 1-1 

Afin)=fin + l)-f{n)= —— . 

q 

Next we find the sequences and {hi}i>i defined in Theorem \4.1U[ The sequence {bi}i>i 

is empty. The sequence is given by Oi = qi — 1. 

Next we compute the sequence using the recurrence in Proposition \4.11\ For anyp, 

0 < p < q-2, = {g-l,p+g,2g-l,p+2g,3g-l,p+3g,4g-l,p+4g,5g-l,p+5g,...}. 

By Proposition \4-.l 1\ the function f{n) is the subword complexity function of a gap increasing 
word because satisfies test T, that is = {qi — l}fli has no elements in common 

with o'q{{ji}'^fi) = {p + q,p + 2q,p + 3q,p + Aq^p + 5q,...} for any p, 0 < p < q — 2. 

This shows that for any integer q > 1 the function 


fin) = 


n 

+ 1)(’^ - ? 

n 

N. 

2 

N. 


) + l 


is the subword complexity function of exactly q — 1 distinct infinite gap increasing words. 
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Example 4.15. We want to find out if there exists an integer q > 2 for which 


f{n) 



n — 1 

q 


+ (n-l-l) 


n — 1 

q 


+ 1 


is a subword complexity function of an infinite gap increasing word. 
One can check that /(I) = 2, /(2) = 3 and 


^f{n) = f{n + l) - f{n) 


n — 1 

q 


+ 1 . 


Next we find the sequences {ai}i>i and defined in Theorem \4.1C\ The sequence 

is empty. The sequence is given by Oi = qi. 

We intend to check if the sequence that is computed using R1-R2 in Proposi¬ 
tion nm also satisfies the test T for at least one p, 0 < p < q — 1. For any p, = 

{g,p + g + 1, 2g,p + 2g + 2, 3g,p + 3g + 2,4g,p + 4g + 3, 5g,p + 5g + 3, To satisfy test 
T the sequence = {qi}°Zi should not have elements in common with o'q{{ji}°Zi) = 

{p + iq + ri}°Zi, where is the non-decreasing sequence in which every positive integer 

n occurs 2'^~^ times, that is {ri}'^^ = {1, 2, 2, 3, 3, 3, 3,4,4,4,4,4,4,4,4, 5, Fork large 
enough (more precisely k = Tk = q — p and thus the kf^ element of aq{{ji}°Zi) 

p + kq + q — p = {k + l)g = a^+i. Thus the test is not satisfied for any p, 0<p<g — 1. 

This shows that there is no integer q > 2 for which 


fip) 



n — 1 

q 


+ (n-l-l) 


n — 1 

q 


+ 1 


is a subword complexity function of an infinite gap increasing word. 

Remark 4.16. By Remark \4.1d[ and Example \4-14\ there can exist several different infinite 
gap increasing words with the same subword complexity function f. However we conjecture 
that, if there exists an n such that A^/(n) = —1, then there exists at most one infinite gap 
increasing word with subword complexity function f. 


5 A geometric representation of the subword complex¬ 
ity of gap increasing words 

In this section we find a geometric representation of the subword complexity function of a 
gap increasing word, which allows us to compute for each positive integer n the exact upper 
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bound of the subword complexity function fw{n) over all infinite gap increasing words w. It 
will turn out that for all n the value of fw{n) is maximized by the same w. 

We say that an inhnite word w has subword complexity (j){n) ultimately if there exists 
a positive integer N, such that for all integers n > N, the subword complexity function of 
w is 4>{n). In this section we determine for which integers a and b there is an inhnite gap 
increasing word w with the subword complexity function an + b ultimately. 

Theorem 5.1. Let w = 0”° 10”^ 10”^ 10”^... be an infinite gap increasing word. Let be the 
subword complexity function of w. For a fixed n > ni, let the partition z/„ he the partition 
whose all parts are all + 1, such that rij + 1 < n. In drawing the diagram of partition 1/^ 
we adopt the French convention, that is the bottom row is the longest row and the left-most 
column is the longest column. Let r{n) + 1 6e the number of parts of Un. The diagram of Vn 
is contained in the diagram of {n + which is an (n + 1) x (r(n) + 1) rectangle (as 

shown in fig. W- 

Consider the boundary line between z/„ and its complement in ((n + (marked 

thickly in fig. Qp. Index the rows of the diagram of {{n + from 0 (top) to r{n) 

(bottom). Let l{n) be the maximum row index (if it exists) for which the portion of the 
{l{n) — 1)*^ row to the left of the boundary line is less than the portion of the l{nf^ row to 
the right of the boundary line. If such row does not exist set l{n) = 0. Ifl{n) > 0, shade the 
portion of the diagram of Un above the row and, ifl{n) < r{n), shade the portion of the 

complement ofvn in below the l{nY^ row. Then /.u,(n+l) = n-\-2-\-shaded area. 



n+l 


Figure 1: The diagram of Un (bounded by the thick line) in the (n + 1) x (r(n) + 1) rectangle 


Remark: for n < Ui the subword complexity function of w is fw{n + 1) = n + 2. 
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Proof. It follows from Proposition Kf.21 that the number of right special factors of w of length 
k > riQ is Sw{k) = 1 + r{k) — l{k), where r{k) is the number of i > 1 such that rii < k — 1 and 
l{k) is the number of z > 1 such that rii-i + rii < k — 2 (it will be shown later that l{k) can 
be dehned the way it was dehned in the hypothesis of Theorem EH). The (n + l)th subword 
complexity of w is 

n n 

f^{n + 1) = /^(l) + '^s^{k) = 2 + ^(1 + r{k) - l{k)) = 

k=l k=l 


r{n) l{n) 

n + 2 + - Hi) - ^(n - nj_i - n* - 1) = 

i=l i=l 


Thus 


/(n) r(n) 

ra + 2 + + 1) + ^ {n-ni). 

l(n) — l r(n) 

/^„(n + 1) = n + 2 + ^ (rzi + 1) + ^ (n-rii). 

i=0 i=l(n)+l 


( 12 ) 


Consider the partition z/„ = (no + 1, ni + 1, nr(n) + 1), where all parts n* + 1 < n. By 
eq. MU (n + 1) equals n + 2 plus the area of the hgure that consists of the portion of 
above the /(n)th row and the portion of the complement of in ((n + below the 

/(n)th row (shaded area in £g. Q). Clearly /(n) = 0 if no + 1 > n — ni. Otherwise l{n) is 
maximal with property n;(„)_i + 1 < n — ri^n)- In the last case we can dehne l{n) in terms 
of the diagram as the maximum integer for which the part of z/„ in the {l{n) — 1)*^ row is 
less than the part of the complement of in the l{ny^ row. □ 


Lemma 5.2. The exact upper hound on the subword complexity function f{n) of an infinite 
gap increasing word is |'n/2] [n/2j + |'n/2] +1. 

Proof. Clearly the shaded area in £g. Q] is maximized by the gap increasing word w with 
Ui = i for z > 0. The gap function of w is g{n) = n and it follows from Lemma ld.5l that 
f(n+2)=f(n)+n+2. Solving this recurrence with initial conditions f(l)=2 and f(2)=3, we get 


f{n) = [n/2] [n/2j + [n/2] + 1. 


□ 
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The diagram of partition in ((n + (fig. [T]), used to compute fw{n + 1), is a 

representation of a prefix of w of length equal to \vn\ (fhe weight of Vn). We will call such a 
diagram a gap increasing prehx diagram. 

A diagram of any partition p with distinct parts, in a rectangle with the vertical side of 
length equal to the number of parts in /i and with the horizontal side of length greater then 
the largest part of p, can be thought as a gap increasing prehx diagram. Let k be the length 
of the horizontal side of the rectangle. Theorem 15.11 gives a method for computing the 
value of the subword complexity function of some inhnite gap increasing word whose prehx 
is represented by fi and whose suffix (one that follows after the prehx represented by p) does 
not contain subwords 10"'l for n < k — 1. We will call this value the complexity f{k) of the 
gap increasing prehx diagram. 

Theorem 5.3. Let a and b be two integers. There exists a gap increasing word w with the 
subword complexity function fw{n) = an + b ultimately if and only if a >2. 

Proof. ( —>■ ) Let a > 2 and b be two integers. To show the existence of an inhnite gap 
increasing word w with the subword complexity an + b ultimately, we hrst build a prehx of 
w and then show the recursive construction of the inhnite suffix of w. 

Our hrst goal is to build a prehx v oiw with the following property: there exists a positive 
integer K, such that any word vu, where u is an inhnite suffix that makes vu gap increasing 
and u has no subwords 10"'l with n < K — 1, has Kth subword complexity fuv{K) = aK + b 
and the number of right special factors of vu of length K — 1 is Suv{K — 1) = a. To build v, it 
is enough to construct its gap increasing diagram D with complexity foiK) = aK + b such 
that the diagram obtained by deleting the right-most column of D has complexity f£i{K) — a. 

Since a > 1, there exists an integer ko > 2a + 4 such that k^ + a"^ < ako -|- b. The diagram 
of partition (1, 2, 3,..., a —1, a, [fco/2j , ko — a + 1, ko — a + 2, ko — a + 3, ..., fco —1) with 2a parts, 
in a rectangle with the vertical side of length 2a and the horizontal side of length ko, is a gap 
increasing partition diagram, which we denote by Dq. By Theorem 15.11 the k(f^ complexity 
foi^ko) of this digram is fco +1 plus the area of the shaded part of the upper diagram in £g. |21 
so fo{ko) = ko + a"^ < ako + b. 

Starting with the initial diagram Do, we will modify the diagram according to the fol¬ 
lowing algorithm. At each odd step we increase k, the length of the horizontal side of the 
rectangle, by one and add one to each part of the partition in the diagram (see £g. I2D. At 
each even step we increase k by one (in £g. n. Let Di denote the diagram obtained after 
i steps of the algorithm and fi denote its complexity. After each odd step of the algorithm 
the shaded area in the diagram increases by a — 1 (the complexity of the diagram increases 
by a), after each even step of the algorithm the shaded area in the diagram increases by a 
(the complexity of the diagram increases by a -|- 1). Thus f 2 i{ko + 2i) = /o(fco) + *(2a -|- 1). 
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Since /o(fco) < ak^ + b, the integer zq = afco + & —/o(^o) is positive. The complexity of D 2 iQ 
is f 2 io{ko + 2zo) = fo{ko) + 2zoa + *o = /o(^o) + ‘2ioa + ako + b — /o(fco) = + 2zo) + b. Let 

K = ko + 2io. The diagram D 2 iQ has complexity / 2 ip(iL) = aK + b. The diagram obtained 
by deleting the right-most column of D 2 iQ has the {K — 1)*^ complexity equal to f 2 io{K) — a. 

The diagram D 2 iQ corresponds to a gap increasing prehx v with the following property: 
any gap increasing word w = vu, such that u has no subwords 10”'! with n < K — 1, has 
subword complexity fw{.K) = aK + b and — 1) = a. 

Next we will show that there exists u such that w = vu is gap increasing and fw{n) = 
an + b for all n > K. By the construction above v = for some 

no < ni < ... < n 2 a-i- Let u = 0"'^“10"2“+U0"2“+^1..., where n* = ni-a + ni-a+i -l- 1 for all 
i > 2a. Since n 2 a+i = ''^a + ^a+i + 1 > K — 1, 10"! with n < K — 1 is not a subword of u. Thus 
w = vu has iLth subword complexity fw{K) = aK+b. By Lemma lT31 s^(n) = s^(iL—1) = a 
for all n > K. This implies that /«,(n) = an + b for all n > K. 

( <— ) At last we have to show that there does not exist an inhnite gap increasing word 
with subword complexity f{n) = n + b ultimately. 

Suppose there exists an inhnite gap increasing word w with subword complexity function 
fw and a positive integer N such that fw{n) = n + b for n > N. For any integers m and n 
such that n > m > N, fw{n) — = n — m + 5, where 5 is the difference between the 

number of distinct subwords of length n and m that contain at least two I’s. For n large 
enough 5 > 0. This proves the claim of the theorem. □ 


6 Blockwise injective words 

Definition 6.1. A function h : N —> N is called blockwise injective if h{i) = h{j) for i < j 
implies h{i) = h{i -|- 1) = ... = h(j). 

Notice that the set of blockwise injective functions includes the set of injective functions 
and the set of non-decreasing functions. 

We will generalize the method of Proposition Id.21 to get a recurrence formula for the sub¬ 
word complexity function of an inhnite word w whose gap function g is blockwise injective. 
Here we will have to distinguish between unbounded and bounded blockwise injective func¬ 
tions. To each unbounded blockwise injective function : N ^ M we assign two functions 
j : N —> N and p : N —>■ N such that j is the injective function which assumes the same 
values and in the same order as g does, and, for each r G N, p(r) is the number of times 
g assumes value j(r). If p : N —> N is a bounded blockwise injective function, let b be the 
number of distinct values that g assumes. Then j{r) is dehned as above for 1 < r < 6, and 
p{r) is dehned for 1 < r < 6 — 1. 
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Proposition 6.2. Let w be an infinite binary word whose gap function g is blockwise injec¬ 
tive. Let j{r) andpir) be defined as above. Consider the following four systems of eguations 
(with the convention that j(0) = 0/- 

j(r) <n< i{r) + max{j{r — 1), [sign{p{r) — l)]j(r)}, (13) 


{ n > fir) < j{r') 

n < j{r) + max{j{r — 1), [sign{p{r) — l)]j(r)} (14) 

n < j{r) + max{j{r' — 1), [sign{p{r') — l)]j(r')}, 


fir + 1) > jir) 

2j(?") <n < p{r)j{r) min{j{r — l),j(r)}, 


(15) 


j(r + 1) < j{r) 

j{rj + j(r + 1) <n< [p{r) — l]j(r) + j(r + 1) + min{j{r — 1), j(r)}. 


(16) 


1) If g is unbounded, then the number of right special factors of length n of w eguals one 

plus the sum of the numbers of r solutions of eguations (03), and (II 3 . 

2) If g is bounded, let b be the number of distinct values that g assumes and jmax = 
max{i (r)}. Then the number of right special factors of length n of w is I (n) + Si {n) + S 2 in), 
where 

j( \ fl) 'If^ jmax 
1 ^ 0 , ^ ^ Jmax') 

Si is the number of r for which there exists an S >1 such that eq. a is satisfied, S 2 is 
the sum of the numbers of r solutions of eg. (03) and (03)- 


Proof. 1) First we consider the case when g is unbounded. 

The word 0” is a right special factor of w for any integer n > 1 (this is the one which is 
not counted by the solutions of the system). 

We will show that the number of right special factors of length n of w which contain 
exactly one 1 equals the number of integer r solutions of eq. m- A binary word which 
contains exactly one 1 is a right special factor of w if and only if it has the form 
where r > 1 and k satishes one of the following: 

(a) p{rj = 1 and 0 < A; < j(r — 1). 

(b) p{r) > 1 and 0 < /c < max{j{r — 1), j(r)}. (Reminder: j(0) = 0) 
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The number of subwords of w of length n of form 0^10-^^'’^ ^ which satisfy conditions (a) 
and (b) equals the number of r’s which satisfy equations (ITK|l and (uni) respectively. 


p{r) = 1 

0 < n — j(r) < j{r — 1), 
p{r) > 1 

0 < n — j(r) < max{j{r — 1), j(r)}. 


(18) 


(19) 


Equations m and (HI have disjoint sets of r solutions and eq. m combines the 
solutions of both, which explains why the number of solutions of eq. m is the same as the 
number of right special factors of length n of w which contain exactly one 1. 

Next we shall count the number of right special factors of length n ofw which contain at 
least two I’s. Here we use the convention that if m is a hnite word, then vP = e (the empty 
word). 

If a right special factor of w contains at least three I’s, the number of O’s between every 
two consecutive I’s in this right special factor should be the same. Indeed, since g is blockwise 
injective, any subword v of w which has lO^lOH (s 7^ t) as a subword, can occur only once 
in to, thus V cannot be a right special factor. Therefore a right special factor of w which 
contains at least two I’s is necessarily, but not sufficiently, of form (PU|) or (EH). 


0fc(i0h’')-i)"^10th+b-i^ k>0, r>l, l<m< p{r). (21) 

We will show that the number of right special factors of length n of form EOD equals 
the number of r solutions of eq. m, and the number of right special factors of length n of 
form EH) equals the number of r solutions of eq. EH- 

First we shall count the number of right special factors of length n and form (OU|) . Let 
V = 0^(10.^^’’)“^)”’' be a subword of w, where fc>0, r>l, 2<m< p{r). The subword v is 
followed by 1 in its leftmost occurrence in tc. It is followed by 0 (which can happen only in 
its rightmost occurrence) if and only if 

i{r + l)> j{r). ( 22 ) 

Hence a binary word of form (EH is a special factor of w if and only if it is a subword 
of w and ()22|1 holds. For a fixed r there exists a subword of w of form (EH and length n, if 
and only if 

2j(r’) <n< p{r)j{r) + min{j{r — 1), j(r)}. (23) 
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It should be mentioned, that for every r that satishes eq. (ESI), there exists a unique 
subword of w of form ESI) and length n. 

Notice that system m is a combination of equations (1^ and ESI), and there is a bijec- 
tive correspondence between right special factors of length n and form (ESI) and r solutions 
of system m- 

At last we shall count the number of right special factors of length n of form (I21|l . Let 
u = be a subword of tn, where /c>0, r>l, l<m< p{r). The 

subword u is followed by 1 in its rightmost occurrence in tn. It is followed by 0 (which can 
happen only in its leftmost occurrence) if and only if 

]{r + l)<j{r). (24) 

Hence a subword of w of form EH) is a right special factor if and only if El holds. For 
a hxed r there exists a subword of w of form EH) and length n (and there can be only one 
such subword) if and only if 

i{r) + i{r + 1) < n < [p(r) — \\j{r) + j(r + 1) +min{i{r — l),j(r)}. (25) 

Thus for every r there exists a right special factor of form EH and length n (this special 
factor happens to be unique) if and only if r satishes system (llb|) . That proves that the right 
special factors of w of length n are counted by the number of r solutions of system El- 

2) Next we consider the case when g is bounded. 

It is clear that 0"^ is a right special factor of w if and only if n < jmax, this accounts for 
I{n). The number of right special factors of length n which contain at least two I’s equals 
S'2, the argument is the same as in the case when g is unbounded. 

At last we have to show that the number of right special factors of length n which contain 
exactly one 1 equals the number of r’s for which there exists an r' > 1 such that eq. El is 
satished. 

Let u be a binary word of length n which contains exactly one 1, then u is a right special 
factor of w if and only if u = and uO = is a subword of w, that is 

0 < n — j(r) < max{j{r — 1), [sign{p{r) — l)]j(r)}. 
and there is r' > 1 such that 

j(r') > j(r) 

~ j{'^) < fnax{j{r' — 1), [sign{p{r') — l)]j(r')}. 

Thus the number of right special factors of w which contain exactly one 1 equals the 
number of r solutions of eq. ()2b|l for which there is r' > 1 such that eq. EH is satished and, 
because eq. El is obtained by combining eq. (ESI) and eq. (El, the claim is proved. □ 


(26) 

(27) 
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Corollary 6.3. Letw he an infinite binary word whose gap function g is non-decreasing and 
iir), p{r) (and b if g is bounded) be defined as before. Let s^{n) denote the number of right 
special factors of length n of w. 

If g is unbounded, then, for every natural n, Sw{n) eguals one plus the number of integer 
r solutions of 


j{r) < n 

j{r — 1) + p{r)j{r) > n + 1. 


(28) 


If g is hounded, for every natural number n consider the ineguality system 


{ r < b 

j{r) < n (29) 

j{r — 1) + p{r)j{r) > n + 1. 

If I < n < j{h) — 1, then Sw{n) eguals one plus the number of integer r solutions of eg. /f^) . 
If n > jib), then Syj{n) is just the number of integer solutions of eg. /f^) . 

Proof. First consider the case when g is nnbonnded. By Proposition 16.21 s..,(n) eqnals one 
pins the nnmber of r’s which satisfy one of the following: 


p{r) = 1 

0 < n < j(r — 1) + j(r), 


(30) 


p(r) > 1 
0 < n < 2j(r), 


(31) 


2j(T’) <n< p{r)j{r) + j(r — 1). (32) 

The set of r solntions of system (OH|) is the disjoint nnion of the sets of r solntions of 
eqnations dSHD, (EH) and E^ - 

Next we consider the case when g is bonnded. Becanse g is non-decreasing, if r satisfies 
eq. (d, then r and r' = r -f 1 satisfy eq. (HI. Thns the set of r’s which satisfy of eq. (d is 
the same set that satishes eq. (USD. By Proposition ISiSl the nnmber of right special factors of 
w of length n eqnals the nnmber of r solntions of eq. ()28|1 with the restriction r < b (becanse 
p(r) is dehned only for r < b) pins I{n), where 


J(n) 


1, if n < j{b) — 1 
0, if n > j{b). 


□ 
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Example 6.4. Consider the infinite binary word whose non-decreasing gap function is given 
j{r) = r + 1 and p{r) = r. 

w = 0(102)2(10^)3(10^)^.. 

By Corolla,rv \(i.tA the number of right special factors of length n of w eguals the number 
of solutions of the system 

{ r + 1 < n 
r + r(r + 1 ) > n + 1 . 

This system has n — \^/n — 2\ solutions, thus 

f^{n + 1 ) = f^o{n) + n- \y/n - 2 | . 

Proposition 6.5. Let w be an infinite binary word with gap function g and subword com¬ 
plexity function f. If g is non-decreasing, then 0 < Af{n) < n+1. If g is blockwise injective, 
then 0 < Af{n) < n{n — l)/2 + 1. 

Proof. The proof is similar to the proof of Proposition 14.11 A/(n) is the number of right 
special factors of length n of w. 

If g is non-decreasing, Af{n) is at least zero (A/(n) = 0 for some n if and only if w is 
ultimately periodic, which happens if and only if g is bounded). Also ^f{n) is at most the 
number of subwords of w of form 0 ^( 10 ^)”^!, where k,l,m > 0, k < I and k -\- [I -\- l)m = n 
(this follows from the proof of Proposition 16.2|) . There are at most n -|- 1 such subwords 
because I completely determines the subword 0 ^( 10 ^)™! and there are n -|- 1 choices for Z, 
hence 0 < Af{n) < n d- 1 . 

If g is blockwise injective, A/(n) is at least zero and at most the number of subwords 
of w of form 0^(10^)™'(10^)*1, where k,l,m,r > 0, I 7 ^ r, k -\- {I + l)m -|- (r -|- l)i = n, 
i = 0 if k = n, and i = 1 otherwise (again follows from the proof of Proposition I6.2|l . 
There are at most n{n -|- l)/2 + 1 such subwords because I and r determine the subword 
0^(10^)”^(10'’)*1 completely (except for the case when k = n) and 0<Z-|-r<n — 1, thus 
there are at most n{n -|- l)/2 such pairs of I and r (add one for the case k = n). Hence 
0< A/(n) <n(n + l)/2 + l. □ 

Corollary 6.6. Let w be an infinite binary word with a blockwise injective gap function and 
subword complexity function f. Then I < f{n) < ^ ^ 1. 

Proposition 6.7. Let a and b be two integers. There exists an infinite binary word w with a 
blockwise injective gap function and subword complexity function fw{n) = an -\- b ultimately 
if and only if one of the two following conditions holds: 

1) a>2. 

2) a = f) and h>l. 
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Proof. It was proved in Theorem 15.hi that for any integers a > 2 and b there exists a gap 
increasing word w with snbword complexity an + b ultimately. Since w is gap increasing, the 
subword complexity function of w is blockwise injective. 

For any integer b > 2 the binary inhnite word Wb = (10^“^)°° has the subword complexity 
f{n) = b for all n > 6 — 1. Clearly the gap function of wt is blockwise injective. 

At last we have to show that an inhnite binary word, whose gap function is blockwise 
injective, cannot have subword complexity f{n) = n + b ultimately. An inhnite word w 
whose gap function is blockwise injective is either ultimately periodic or 0 * 10 -^ is a subword 
of w for all i and j. If tc is ultimately periodic, then its subword complexity function is 
ultimately constant and the claim is proved. If 0*10-^ is a subword of w for all i and j, then 
w contains exactly n + 1 distinct subwords of length n that contain at most one 1. Thus for 
any m < n, fwip) — fwijn) = n — m + 6m,n, where 5m,n is the diherence between the number 
of distinct subwords of length n and m that contain at least two I’s. To every subword v 
of w of length m that contains at least two I’s we can put in correspondence the subword 
M of w of length n with prehx v (this u contains at least two I’s). For any m, there exists 
n large enough such that is a subword of w whose prehx of length m contains only 

one 1, hence 6m,n > 1- This means that there does not exist m such that for all n > m the 
subword complexity of tc is /(n) = n + 6. □ 
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